Exploiting Partial Operand Knowledge

نویسندگان

  • Brian R. Mestan
  • Mikko H. Lipasti
چکیده

Conventional microprocessor designs treat register operands as atomic units. In such designs, no portion of an operand may be consumed until the entire operand has been produced. In practice, logic circuits and arithmetic units that generate some portion of an operand in advance of the remaining portions are both feasible and desirable, and have been employed in several existing designs. In this report, we propose to exploit the early partial knowledge of an instruction’s input operands for overlapping the execution of dependent instructions and resolving unknown dependences. In particular, four applications of partial operand knowledge are examined: bypassing partial operands, resolving mispredicted conditional branches, disambiguating loads from earlier stores, and performing partial tag matching in set-associative caches. We find that each of these is feasible with limited knowledge of input operands. With the goal of fully exploiting this characteristic, we propose and evaluate a bit-sliced microarchitecture that decomposes a processors’s data path into 16and 8-bit slices. We show that a bit-sliced design which uses partial operand knowledge can outperform a conventional design with pipelined functional units, and can achieve IPC only slightly lower than a best-case non-pipelined design running at the same clock frequency. Specifically, we find that a bit-slice design using two 16-bit slices achieves IPC within 1% of an ideal design with non-pipelined functional units, representing a 16% speed-up over a conventional pipelined design.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optical Implementation of a Single-Iteration Thresholding

Threshold (or relative magnitude) search is traditionally performed iteratively in a bit-serial manner in optical database/knowledge-base machines which results in an execution time proportional to the operand size. We present in this paper a single-step threshold search algorithm and its optical implementation. The proposed algorithm performs magnitude comparison in constant time, independent ...

متن کامل

On the Design of Fast , Easily Testable ALU

A design methodology for implementing fast, easily testable arithmetic-logic units (ALU’s) is presented. Here, we describe a set of fast adder designs, which are testable with a test set that has either ( ) complexity (Lin-testable) or (1) complexity (C-testable), where is the input operand size of the ALU. The various levels of testability are achieved by exploiting some inherent properties of...

متن کامل

Amorphous Infrastructure for Language Implementation

We propose a method for the robust implementation of simple graphical automata on an amorphous computer. This infrastructure is applied to the implementation of purely functional programming languages. Specifically, it is used in conjunction with data-flow techniques to implement a toy language homologous to recurrence equations, exploiting control-flow parallelism through parallel operand eval...

متن کامل

Optical implementation of a single-iteration thresholding algorithm with applications to parallel data-base/knowledge-base processing.

Threshold (or relative magnitude) search is traditionally performed iteratively in a bit-serial manner in optical data-base/knowledge-base machines, which results in an execution time proportional to the operand size. We present a single-step threshold search algorithm and its optical implementation. The proposed algorithm performs magnitude comparison in constant time, independent of the opera...

متن کامل

Oware: Operand width Aware Redundant Execution for Whole-Processor Error Detection

As the feature size of semiconductor technology continues to shrink, high-performance microprocessors are increasingly susceptible to soft errors. Exploiting the fact that narrow-width values universally exist in applications, prior in-register duplication approaches for improving reliability of register file and other data-holding components mitigate performance cost but leave the rest of data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003